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4.6 Cache invalidation 



Since the SuperCore macro does not support snooping or cache coherency, the processor 
may need to invalidate some or all of the cache lines under program control. For example, 
the programmer may construct a program sequence in the global memory and then try to 
execute it. Also, a debugger program may plant a TRAP opcode in an instruction set. In 
some cases only a few lines will need to be flushed, in others all lines will be flushed. The 
instruction cache supports single line and whole array invalidation. A cache line is invali- 
dated by setting the line-valid bit to zero. At the same time the corresponding LRU bit will 
also be set to 0, indicating that the way-0 line will be over-written first. 

4.7 Cache Tags 



Each tag in the instruction cache is accessible directly in the local address region of the 
memory. Tags are accessed as 32-bit values. The instruction cache tags have the format 
shown in Table 1. 



Table 1 Instruction Cache Tag 



Bit # 


31—24 


23—22 


21 


20 


19—0 


Name 


R'SVD 


SB 


L 


V 


TAG 



Bit# 


Name 


Description 


31—24 


R'SVD 


Reserved. 


23—22 


SB 


Software breakpoint. Indicates the behavior of this cache line when 
used to assist handling a "software'' breakpoint. 
If 00: normal cache behavior and replacement algorithm; 
if 01 : use_once behavior; 
if 10: debugjock behavior; 
if 1 1 ; dont_use_once behavior. 


21 


L 


LRU. The least recently used bit governing which way will be replaced 
on the next cache miss. There is physically one bit. Reads of this bit in 
each way report the state of the bit. Writes of this bit in either way 
update the bit. 


20 


V 


Valid. Indicates whether or not this cache line contains valid data. 


19—0 


TAG 


Address tag. An access to the cache hits this line if the most signifi- | 
cant 20 bits of the address match this TAG and the V bit is set and the 
SB behavior is normal, use_once, or debug_lock. 



Multiprocessor software breakpoints on Pegasus are supported by two mechanisms. The 
two mechanisms are supported by two tag bits in the instruction cache called SB. The first 
mechanism (referred to as use_once) is intended for use by a target resident debug moni- 
tor (or RTOS debug task). This mechanism is relatively intrusive to code running on the 
SC140s even if that code is not intended to take a software breakpoint. However, there are 
no hardware restrictions on the number of software breakpoints. The second mechanism 
(referred to as dont_use_once) operates by locking lines in the cache that include break- 
Lucent Technologies - Proprietary (Restricted) 
Solely for authorized persons having a need to know 
Use pursuant to Company instructions 



82 



May 2000 Rev 1.1 



Lucent Technologies Inc. 



Instruction Cache 



point opcodes. This mechanism is non-intrusive to code that is not intended to take the 
software breakpoint. However, this mechanism is more restricted as far as the placement 
of software breakpoints. 

The SB bits are encoded as follows: 

■ 00 - normal instruction cache behavior and replacement algorithm. 

■ 01 - use_once behavior; service the first fetch of this address from 
the cache and then invalidate this cache line. 

■ 10 - debug Jock behavior; service all fetches to this address from 
the cache and do not allow replacement of this line. 

■ 1 1 - dont_use_once behavior; on the first fetch of this address, do 
not service the access from the cache; after the first fetch, switch 
behavior to debug_lock; do not change the valid state of this line. 

Further information on the use of the software breakpoint bits is found in the JTAG, 
EOnCE, and Scan section. 

4.8 Instruction Cache Internal Structure 



The instruction cache contains two single-port SRAM arrays for data and two single-port 
SRAM arrays for the tags. The valid and LRU bits are implemented separately from the tag 
since there is a requirement to reset them in a small number of cycles at reset and on a 
cache flush. 

Each data array has 128 rows and 256 columns in a standard format. The write drivers 
support independent update of the two 16-byte sectors in a cache line. When a cache line 
is updated the requested data is bypassed to the P-bus pins. The output data must be 
latched and only change as a result of a processor request. 

Each tag array has 128 rows, each row containing a 20-bit tag address field. On read, the 
tag bits pass directly to 20-bit comparators, that check for equality with the high 20-bits of 
the request address. If one of the tags matches the address and the valid bit is set, the 
data for the matching way is forwarded to the SC140. 

When a stall situation is detected, the Cache Manager will activate a state-machine to 
handle the cache fill operation. The timing constraints for these fill cycles will be relatively 
relaxed. 

A rough outline of a two-way set associative implementation is present below for illustra- 
tion only. 
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13.2.13 Intest 



The extest instruction causes a user-defined boundary scan register to be placed in the 
scan chain between the TDI and TDO signals. In addition, the user-defined scan chain 
interface signals are set to indicate test mode for the boundary scan register and that 
intest is present in the instruction register when this instruction is loaded into the JIR by 
transiting the Update-IR state. 

13.2.14 Bypass instruction 



The bypass instruction causes the bypass register flip-flop to be placed in the scan chain 
between the TDI and TDO signals. In addition, the boundary scan register is placed in the 
normal mode when this instruction is loaded into the JIR by transiting the Update-IR state. 
The bypass instruction is also selected if any reserved instruction is loaded into the JIR. 

1 3.2.1 5 Bypass register 



The bypass register is a 1 bit register. On traversing the Capture-DR state, the bypass reg- 
ister is loaded with 0. The bypass register passes its input data to its output signal after 
being clocked. 

13.3 Debug operations 



This section describes the JES support for certain common debug operations. The proce- 
dure described here is not intended to be the only means for implementing the operation 
and the debug host/debugger is free to use any means that work. 

13.3.1 Software instruction breakpoints 



Software instruction breakpoints for a single processor are traditionally implemented by 
overwriting the instruction at which a breakpoint is desired with a DEBUG or DEBUGEV 
opcode. This approach does not work if the instruction memory is not writable or in a mul- 
tiprocessor system. In a multiprocessor system, a software breakpoint may be intended for 
one or more, but not all, of the CPUs. In this case, a traditional software breakpoint would 
cause all of the CPUs to stop rather than just the intended one. The JES block addresses 
these problems with hardware support. This hardware support is in the form of extra tag 
bits in the instruction cache. These tag bits enable a debug host or target-resident debug 
monitor to select the level of intrusiveness of "software" breakpoints and sequence a CPU 
when it encounters such a breakpoint. 

Multiprocessor software breakpoints on Pegasus are supported by two mechanisms. The 
two mechanisms are supported by two tag bits in the instruction cache called swbpt[1 :0]. 
The first mechanism (referred to as use_once) is intended for use by a target resident 
debug monitor (or RTOS debug task). This mechanism is relatively intrusive to code run- 
ning on the SC140s even if that code is not intended to take a software breakpoint. How- 
Lucent Technologies - Proprietary (Restricted) 
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ever, there are no hardware restrictions on the number of software breakpoints. The 
second mechanism (referred to as dont_use_once) operates by locking lines in the cache 
that include breakpoint opcodes. This mechanism is non-intrusive to code that is not 
intended to take the software breakpoint. However, this mechanism is more restricted as 
far as the placement of software breakpoints. 

The swbpt bits are encoded as follows: 

■ 00 - normal instruction cache behavior and replacement algorithm. 

■ 01 - use_once behavior; service the first fetch of this address from 
the cache and then invalidate this cache line. 

■ 10 - debugjock behavior; service all fetches to this address from 
the cache and do not allow replacement of this line. 

■ 1 1 - dont_use_once behavior; on the first fetch of this address, do 
not service the access from the cache; after the first fetch, switch 
behavior to debugjock; do not change the valid state of this line. 

The use_once mechanism would typically be used with a target resident debug monitor or 
an RTOS debug task although it could also be implemented by a debug host. The basic 
operation is to set SC140 debug opcodes in shared main memory on instructions that 
have a software breakpoint in any SC140. Each SC140 takes a debug exception to their 
debug monitor/debug task when they encounter the debug opcode. The debug exception 
handler consults a table of software breakpoints in memory for this Pegasus and either 
gives control to the debugger user or resumes execution depending on if this software 
breakpoint is intended for this SC140 or not (respectively). 

In the case where an SC140 encountered a software breakpoint not intended for it, the 
swbpt bits makes it possible to "step" the breakpointed instruction in the SC140's instruc- 
tion cache, even if the instruction is in a noncacheable segment. Strictly speaking, the 
swbpt bits control whether or not a fetch address is serviced from the instruction cache 
even if that address is noncacheable. The cache line with the instruction to be stepped is 
placed in the cache and the swbpt bits set to use_once. Execution is resumed by fetching 
this address. After this access, the swbpt bits are change to normal cache behavior and 
that cache line is invalidated. The SC140 may be stalled during this change in the swbpt 
bits. The swbpt bits are by an LPB write to the instruction cache tag. The cache line with 
the swbpt bits set for use_once holds a line whose shared main memory copy contains 
debug opcodes. 

The following is an example of the algorithm that may be used by the debug exception 
handler (and possibly host debugger) to "step" the software breakpoint on SC140s that 
are not the intended targets of the breakpoint. In the following description, comments such 
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as // Host: may be implemented by the debugger host or the debug exception handler. 

// Host: Debug host writes a debug opcode to the shared memory line, using 
// a read-modify- write operation. 

// Host: Debug host configures eonce for exceptions on debugev opcodes, 
debugev exception handler { 

for (i=0; i<size of this core's bpt table; i++) { 

if ( "compare (break_address, bpt_table [i] . address) ) { 
// compare () can use hashing or other comparison operation, 
break; 

} 

break_this_core = FALSE; 

if (bpt_table [i] . idmask & this_core ' s_id) { 

break_this_core = TRUE; 
// exception handler configures eonce for debug mode 
//on debugev opcodes 

DEBUGEV 

// Host: Debug host user has control ... 
// ... resume 

} 

modif ied_line = * (prog addr *) (break_address) 
modif ied_line &= bpt_table [i] . debug_opcode_mask 
modif ied_line |= bpt_table [i] . opcode 

line_addr = LPB address of cache line for break_address 
*line_addr = modif ied_line 

if (break__this_core || noncacheable (break_address) ) { 
set swbpt bits to use_once 

} 

RTE // from debug exception handler 

} 

This algorithm requires a software breakpoint table for each SC140 in memory. The struc- 
ture of this table may be whatever is most efficient for the execution of the debug exception 
handler and use of the application memory. Semaphore access to this table is supported 
from both code running on the SC140 (through the BMTSET opcode) as well as from the 
debug host (through the JES block). The access to a specific cache line (based on physi- 
cal addresses) using a virtual address is supported by the Pegasus address translate reg- 
ister. The target of the RTE is the execution set in the cache line with swbpt set to 
use_once. If this is not true, the cache line with use_once will prevent other lines with the 
same cache index from being cached. A cache line with use_once is never replaced by 
the cache replacement hardware. 

The debug lock bit (debug_lock) mechanism is used when a more non-intrusive software 
breakpoint is required. Setting the swbpt bits to debugjock indicates that an access to 
this line should be served from the cache, even if the address is in a noncacheable seg- 
ment, and that this line should never be replaced. The line set to debugjock holds debug 
opcodes for breakpoints. When the swbpt bits are set to debugjock, this value is changed 
only by an LPB write to the instruction cache tag memory. 

When a breakpoint occurs for a line set to debugjock, the debug host or debug monitor 
then sets the line to dont_use_once to enable the actual instruction to be executed instead 
of the debug opcode. This programming of the swbpt bits causes the instruction cache to 
fetch the line from main memory rather than using the contents of the cache. Once the first 
access to this line has been observed, the cache control logic sets the swbpt bits to 
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// Sole property of Lucent Technologies Inc. containing its 

// Proprietary Confidential information. 

// 

// This unpublished source code is CONFIDENTIAL PROPRIETARY of 

// Lucent Technologies Inc. 

// 

// Project: PEGASUS 
// Block: ic_d 
// Name: ic_d.v 
// 

// Description: This is a block in the PEGASUS SuperCore ( tm) . The function 

// of this block is Instruction Cache Debug Mode 

// 

// Change List: 

// Date By Description 

// 

// 4/03/00 Han Q. Nguyen Ver 1.0 code structure 

// 6/22/00 Han Q. Nguyen Ver 2.0 code structure with GenComps Library 

// 

/***********************^ 
"include "../.. /timescale . v" 

module ic_d ( gclkw, pclk, resetb, psel_id, psel_it, prdata, pwdata, cm_inval, 
tag_addr, 

penable, paddr, pwrite, pwait, tag_rdata, tag_rdatb, tag_wdat, 

dtag_ad, 

tag_we_a, tag_we_b, dat_rdata, dat_rdatb, dat_wdat, ddat_ad, dat_bw, 

dat_we_a , 

dat_we_b, icd_stall, pmem_rd, di_bg_hip, lpbgnt, va_in, va^out, 

clear_v, 

vb_in, vb_out, lrui, lruo, updf_a, updf_b, updl, upd_sb_a, upd_sb_b, 

sb_i, sb_a, 

sb_b, cm_icfa, cm_icfl, cm_clva, dbg_dont_use_one, ysenl, ysenO 

); 

input gclkw; // global system clock 

input pclk; // LPB clock 

input resetb; // global pegasus reset 

input psel_id; // Debug mode chipselect from BG 

input psel_it; // Debug mode chipselect from BG 

output [31:0] prdata; // LPB read data 

input [31:0] pwdata; // LPB write data 

input cm_inval; // command from CM saying clear valid bit on 

write 

input [31:12] tag_addr; // latched TAG addr 

input penable; // LPB Enable signal 

input [31:0] paddr; // LPB address 

input pwrite; // LPB write signal 

output pwait; // LPB wait signal 

input [19:0] tag__rdata; // TAG Write data 

input [19:0] tag__rdatb; // TAG Write data 

output [19:0] tag_wdat; // TAG Read data 

output [6:0] dtag_ad; // TAG Address 

output tag_we_a; // TAG Address 

output tag_we_b; // TAG Address 



input 

input 

output 

output 

output 

output 

output 

output 

input 

output 

input 

input 

input 

input 

output 

output 

output 

output 

output 

output 



[255:0] dat_rdata; // DATA Write data 

[255:0] dat__rdatb; // DATA Write data 

[127:0] dat_wdat; // DATA Read data 

[255:0] datjDW; // DATA Address 

[6:0] ddat_ad; // DATA Address 

dat_we_a; // data write enable la 

dat we b; // data write enable 2b 



icd_stall; 
pmem_rd ; 
lpbgnt ; 
di_bg_hip ; 
va_out ; 
vb_out ; 
Iruo; 
va_in; 
vb_in; 
lrui; 
updf_a; 
updf _b ; 
updl ; 



output upd_sb_a; 
output upd_sb_b; 



output [1:0] sb_i; 
input [1:0] sb_a; 
input [1:0] sb_b; 
input cm_icfa; 
input cm_icfl; 
input [31:0] cm_clva; 
dump/ res tore 
output clear_v; 
output dbg_dont_use_one; 
input ysenO; 
input ysenl; 



// stall SC when access IC in debug mode 
// program read strobe from sc 
// Bus grant for LPB, use for controlling MUX 
// high priority signal for LPB 
// Valid flag A info 
// Valid flag B info 
// LRU flag info 
// Valid flag A data 
// Valid flag B data 
// LRU flag data 
// update valid flag 
// update valid flag 
// update LRU flag 
// update SB flag 
// update SB flag 

// SB input data 
// SB output data 
// SB output data 
flush all lines 
flush this line 



// 
// 
// 



(set all valid bits to zero) 
(set valid bit to zero) 



address of 32 -bit data in cache to 



// Clear Valid flag 

// don't use once flag 

// clock control by scan 
// clock control by scan 



parameter 
RW_DONE = 
parameter 
parameter 



P_IDLE = 3'bOOO, PWS = 3'bOOl, P_WAIT = 3'bOlO, RW_ACC = 3'bOll, 
'bl00; 

IDLE = 
NORM = 



2'b00, P_REQ = 2'bOl, P_GNT = 2'blO, 
3'bOOO, USE_ONCE = 3'bOOl, DBG_LCK = 



P_ACK = 2 'bll; 
3 'bOlO, 



DONT_ 


_USE_ONCE 


= 3'blOO, 








FLUSH_ALL = 3'blOO, 


FLUSH LINE = 3 'blOl; 


wire 


[2:0] 


lpb_p_state; 




// present state 


reg 


[2:0] 


lpb_n_state; 




// next state 


wire 


[1:0] 


arb_p_state; 




// present state 


reg 


[1:0] 


arb_n_state; 




// next state 


wire 


[2:0] 


dbgj>_state; 




// present state 


reg 


[2:0] 


dbg_n_state; 




// next state 


wire 


reset / 


// internal reset 


wire 


[31:0] 


ictag; 


// 


TAG data 


reg 


[31:0] 


icdat ; 


// 


DAT data 


reg 


[255:0] 


dat_bw; 


// 


DATA Address 


reg 


[255:0] 


dat_wdati; 


// 


DATA Read data 


wire 


[255:0] 


dat_rdat ; 


// 


DATA Read data 


wire 


[19:0] tag_rdat; 


// 


TAG Read data 



wire cm_icfa_d; 



// flush all lines (set all valid bits to zero) 



// flush this line (set valid bit to zero) 
// flush all lines (set all valid bits to zero) 
// flush this line (set valid bit to zero) 
// 32 bit zero 
// 32 bit zero 



// 
// 



// 



LPB request 
LPB grant 
// LPB ack 
stall SC 

select line L,H 
select way A,B 



// 
// 



// select line L,H 



wire cm_icfl_d 
wire cm_icfa_p 
wire cni_icfl_j? 
wire [31:0] zero32; 
wire [31:0] one32; 
wire lpb_req; 
wire lpb_gnt; 
wire lpb_ack; 
wire stall; 
wire sel_line; 
wire sel_ab; 
wire v__ab; 
wire line_a, line_b; 
wire dbg_use_one; 
wire dbg_lock; 
wire dbg_f lush_line; 
wire dbg_f lush__all ; 
wire sb_stalli / d_stall; 
reg sb_stall; 

wire [1:0] sb; // SB output data 

wire clkl__0, clkl__l, pclkl_0; 

wire qualO; // clock qual input 

wire quail; // clock qual input 

// 



// Qualifier element 
// 



phgenDFFl Ul_phgen ( . CLK (gclkw) , . QUAL (quail) , . YSEN (ysenl) , 
.QCLK(clkl_0) ) ; 

phgenDFFO U2_phgen (. CLK (gclkw) , .QUAL (qualO) , . YSEN (ysenO ) , 
.QCLK(clkl_l) ) ; 

phgenDFFl U3_phgen (.CLK(pclk), .QUAL (quail) , . YSEN (ysenl) , 
.QCLK(pclkl_0) ) ; 

// 



// Misc. signals 

// 



assign qualO = l'bl; 
assign quail = l'bl; 
assign reset = -resetb; 
assign zero32 = 32'bO; 

assign one32 = 32^11111111111111111111111111111111; 

assign lpbgnt = lpb_gnt; 

assign sel_line = paddr[4]; 

assign sel_ab = lruo; 

// assign sel_ab = paddr[12]; 

assign v_ab = (sel_ab == l'bl)? vb_out : va_out; 
assign sb = (sel_ab == l'bl)? sb_b : sb_a; 



// 



// Arbitor signal 



// 



assign lpb_req = (lpb_p_state == PWS) ; 
assign lpb_gnt = (arb_p_state == P_GNT) / 
assign lpb_ack = (lpb_p_state == RWJDONE) ; 



// 



// LPB slave Wait, LPB stall SC 
// 



assign pwait = (lpb_n_state == PWS | | lpb_p_state == PWS | | lpb__p_state == 
PJtfAIT) ; 

assign stall = ( (arb_n_state == PJ3NT) | | (arbjp_state == P_GNT) ) ; 
assign d_stall = stall | sb_stalli; 

DFFOR # ( 1) U1_DFF1 ( . D (d_stall) , . QCLK (clkl_l) , .RB(resetb), 
.Q(icd_stall) ) / 



// 

// LPB slave write enable 
// 



assign tag_we__a = ( (lpb_p_state == RW_DONE) && psel_it && pwrite && 
!sel_ab && penable && lpb_gnt) ; 

assign tag_we_b = ( (lpb_p_state == RW_DONE) && psel_it && pwrite && sel_ab 
&& penable && lpb_gnt) ; 

assign dat_we_a = ( (lpb_p_state == RW_DONE) && psel_id && pwrite && 
!sel_ab && penable && lpb_gnt) / 

assign dat_we_b = ( (lpb_p_state == RW__DONE ) && psel_id pwrite && sel_ab 
&& penable lpb_gnt) ; 



// 



// Array update enable 

// 



assign updf_a = (lruo & dbg_use_one & !lpb_gnt) | (tag_we_a) | (line_a & 
dbg_f lush_line & !lpb_gnt); 

assign updf_b = (-lruo & dbg_use_one & !lpb_gnt) | (tag_we_b) | (line_b & 
dbg_f lush_line & !lpb_gnt); 

assign updl = tag_we_a | tag_we_b; 

assign clear_v = dbg_use_one | dbg_flush_line; 



// 



// mux selecting 32 -bit word from data array 

// 



always @ ( paddr or dat_rdat) 
begin 

case ( paddr [4:2] ) 
3 'bOOO: 

icdat = dat__rdat [31 :0] / 
3'bOOl: 

icdat = dat_rdat [63 :32] ; 



3 


•bOlO: 










icdat 


= 


dat_ 


_rdat [95 :64] ; 


3 


•bOll: 










icdat 


= 


dat_ 


rdat [127:96] ; 


3 


■blOO : 










icdat 


= 


dat_ 


rdat [159:128] 


3 


■blOl: 










icdat 


— 


dat_ 


rdat [191:160] 


3 


■bllO: 










icdat 




dat_ 


_rdat [223 : 192] 


3 


•bill : 










icdat 




dat_ 


rdat [255:224] 


default : 










icdat 




dat_ 


rdat [31:0] ; 



endcase 

end 

// 



// TAG data for LPB 

// 



assign dat_rdat ~ (paddr[12] == l'bl)? dat_rdatb : dat_rdata; 
assign tag_rdat = (paddr[12] == l'bl)? tag_rdatb : tag_rdata; 
assign ictag = {8'bO, sb, lruo, v_ab, tag_rdat}; 

// 



// LPB read data. Muxing Tag or Dat array 
// 



assign prdata = ( (lpb_p_state == RWJDONE) && penable && psel_id) ? icdat : 

32 'bz; 

assign prdata = ( (lpb_jp_state == RW_DONE) && penable && psel_it) ? ictag : 

32 'bz; 

// 



// TAG write data into array 

// 



assign tag_wdat [19 : 0] = pwdata [19 : 0] ; 
assign va_in = pwdata [20]; 
assign vb__in = pwdata [20]; 
assign lrui = pwdata [21]; 

// 



// mux selecting 32 -bit word write data into array 
// 



always @ (paddr or pwdata or dat_wdati) 
begin 

case ( paddr [4:2] ) 
3 'b000 : 

dat_wdati = {dat__wdati [255 : 32] , pwdata} ; 
3'bOOl: 



dat_wdati = {dat_wdati [255 : 64] , pwdata, dat_wdati [31 : 0] } ; 
3 *b010: 

dat_wdati = {dat_wdati [255 : 96] , pwdata, dat_wdati [63 : 0] } ; 
3 »b011: 

dat_wdati = {dat_wdati [255 : 128] , pwdata, dat_wdati [95 : 0] } ; 
3'blOO: 

dat_wdati = {dat_wdati [255 : 160] , pwdata, dat_wdati [127 : 0] } 
3'blOl: 

dat_wdati = {dat_wdati [2 55 : 192] , pwdata, dat_wdati [159 : 0] } 
3'bllO: 

dat_wdati = {dat_wdati [255 : 224 ] , pwdata, dat_wdati [191 : 0] } 
3'blll: 

dat_wdati = {pwdata, dat_wdati [224 :0] } ; 
default : 

dat_wdati = 256' bO; 
endcase 

end 

assign dat_wdat = (sel_line == l'bl)? dat_wdati [255 : 128] : 
dat_wdati [127 : 0] ; 

// 



// mux selecting 32 -bit word BW enable 
// 



always @ ( paddr or one32 or dat_bw) 
begin 

case ( paddr [4: 2] ) 
3 'bOOO : 

dat_bw = {dat_bw[255:32] , one32}; 
3'bOOl: 

datjDW = {dat_bw[255:64] , one32, dat_bw [3 1 : 0] } ; 
3'bOlO: 

dat_bw = {dat_bw[255:96] , one32, dat_bw [63 : 0] } ; 
3'bOll: 

dat_bw = {dat_bw[255:128] , one32, dat_bw [95 : 0] } ; 
3'blOO: 

dat_bw = {dat_bw[255 :160] , one32, dat_bw [127 : 0] } ; 
3'blOl: 

dat_bw = {dat_bw[255 :192] , one32, dat_bw [159 : 0] } ; 
3'bllO: 

dat_bw = {dat_bw[255 :224] , one32, dat_bw [191 : 0] } ; 
3' bill: 

dat_bw = {one32, dat_bw [223 : 0] } ; 
default : 

dat_bw = 256' bO; 
endcase 

end 

// 



// LPB slave address 
// 



assign ddat_ad = (lpb_gnt == l'bl)? paddr [11: 5] : 7'bO; 

assign dtag_ad = (lpb_gnt l'bl)? paddr[ll:5] : cm_clva [11 : 5] ; 



// 



// Sequential state transition LPB state machine 

// 



DFF1R #(3) U7JDFF1 ( . D (lpb_n_state) , . QCLK (pclkl_0 ) , .RB(resetb), 
.Q (lpb_p_state) ) ; 

// 



// Comb logic LPB state stranstion Block 

// 



always @ ( lpb_j>_state or lpb_gnt or psel_it or psel_id ) 
begin 

case ( lpb__p_state ) 
P_IDLE: 
begin 

if { (psel_it | | psel_id) ) 
begin 
if ( ! lpb_gnt ) 

lpb_n_state = PWS; 
else 

lpb_n_state = RW_ACC; 

end 
else 

lpb_n_state = P_IDLE; 

end 
PWS: 

lpb_n_state = P_WAIT; 
P_WAIT: 
begin 

if ( lpb_gnt ) 

lpb_n_state = RW_ACC; 
else 

lpb_n_state = P_WAIT; 

end 
RW_ACC : 

lpb_n_state = RW_DONE; 
RW_DONE : 

lpb_n_state = P_IDLE; 
default: 

lpb_n_state = P_JDLE; 

endcase 

end 

// 



// Sequential state transition Arbitor state machine 

// 



DFF1R #(2) U2_DFF1 ( . D (arb_n_state) , . QCLK (clkl_0 ) , .RB(resetb), 
. Q (arb_p_state) ) ; 



// 



// Comb logic Arbitor state stranstion Block 

// 



always @ ( arb_j>__state or pmem_rd or lpb_ack or lpb_req or di_bg_hip) 
begin 

case ( arb_p_state ) 
IDLE: 
begin 

if ( lpb_req ) 

arb__n_state = P_REQ ; 
else 

arb_n_state = IDLE; 

end 
P_REQ : 
begin 
if ( ! lpb_req) 
begin 
if (di_bg_hip) 

arb_n_state = P_GNT; 
else 
begin 
if ( !pmem_rd) 

arb_n_state = PJ3NT; 
else 

arb_n_state = P_REQ; 
end 
end 
else 

arb_n_state = P_REQ; 

end 
P_GNT : 

if ( lpb_ack ) 

arb_n_state = P_ACK; 
else 

arb_n_state = PJ3NT; 
P_ACK : 

arb_n_state = IDLE; 
default : 

arb_n_state = IDLE; 
endcase 

end 



// 



// Sequential state transition DBG state machine 
// 



DFF1R #(3) U3_DFF1 ( . D (dbg_n_state ) , . QCLK (clklj) ) , .RB(resetb), 
. Q (dbg_p_state) ) ; 

// 



// Comb logic DBG state stranstion Block 
// NORM = 3 1 bO 00, USEJDNCE = 3'bOOl, DBG_LCK = 3'bOlO, DONT_USE_ONCE = 
3'bOll, 

// FLUSH_ALL = 3»bl00, FLUSH_LINE; 



// 



always @ ( dbg_p_state or sb or cm_icfa_p or cm_icfl_p or lpb_gnt or 
cm_inval) 

begin 

case ( dbg_p_state ) 
NORM : 
begin 
if (!lpb_gnt) 
begin 
if (sb == 2 »b01) 

dbg_n_state = USE_ONCE; 
else if (sb == 2 'blO) 

dbg_n_state = DBG_LCK; 
else if (sb == 2 'bll) 

dbg_n_state = DONT_USE_ONCE ; 
else if (cm_icfl_p | cm_inval) 

dbg_n_state = FLUSH_LINE; 
else if (cm__icf a_p) 

dbg_n_state = FLUSH_ALL ; 
else 

dbg_n_state = NORM; 

end 
else 

dbg_n_state = NORM; 

end 
USEJDNCE : 
begin 



dbg_n_state 


= NORM; 


end 




DBG LCK: 




if ( sb == 2 'bOO ) 


dbg_n_state 


= NORM; 


else 




dbg__n_state 


= DBG_LCK; 


DONT_USE_ONCE : 




dbg_n_state 


= DBG__LCK; 


FLUSH_ALL : 




dbg_n_state 


= NORM; 


FLUSH_LINE: 




dbg_n_state 


= NORM; 


default : 




dbg_n_state = 


NORM; 



endcase 

end 

// 



// Comparator for tag match 

// 



assign line_a = ( ( tag__rdata [19 : 0] == cm_clva [31 : 12] ) || 

(tag_rdata [19:0] == tag__addr [3 1 : 12 ] ) ) ? l'bl : 1'bO; 

assign line_b = ( (tag_rdatb [19 : 0] == cm_clva [31 : 12] ) || 

(tag_rdata[19:0] == tag_addr [31 : 12] ) ) ? l'bl : 1'bO; 



// 



// control logics for debug SB states 

// 



assign dbg_use_one = (dbg_p_state == USE_ONCE) ; 

assign dbg_lock = (dbg_p_state == DBG_LCK) ; 

assign dbg_dont_use_one = (dbg_p_state == DONT_USE_ONCE) ; 

assign dbg_f lush_line = (dbg_p_state == FLUSH_LINE) ; 

assign dbg_f lush_all = (dbg_p_state == FLUSH_ALL) / 



// 



// stall SC for clearing SB bits in use-Once mode 

// 



assign sb_stalli = (dbg_n_state == USE_ONCE | | dbg_n_state == FLUSH_LINE 
I I dbg_n_state == FLUSH_ALL) ; 

// 



// Mux for SB bits 

// sb_i[l:0] = pwdata [23 :22] ; 

// 



reg [1:0] sb_i; // SB input data 

always @ (lpb_gnt or dbg_use_one or dbg_dont_use__one or pwdata) 
begin 

case ( {lpb_gnt, dbg__dont_use_one, dbg_use_one} ) 
3'bOOl: 

sb_i [1:0] = 2 'b00; 
3 f b010: 

sb_i [1:0] = 2 «bl0; 
3 «bl00: 

Sb_i[l:0] = pwdata [23 : 22] ; 
default : 

sb_i[l:0] = 2'b00; 
endcase 

end 

// 



// Update SB array 

// 



assign upd_sb_a = (dbg_use_one & ! lpb_gnt & lruo) | (dbg_dont_use_one & 
!lpb_gnt & lruo) | tag_we_a; 

assign upd_sb_b = (dbg_use_one & ! lpb__gnt & !lruo) | (dbg_dont_use_one & 
!lpb_gnt Sc !lruo) | tag_we_b; 



// 



// flush pulse 
// 



DFF1R # ( 1) U4_DFF1 ( . D (cm_icf a) , . QCLK (clkl_0) , .RB(resetb) 
. Q (cm_icf a_d) ) ; 

DFF1R #(1) U5_DFF1 ( .D (cm_icf 1) , . QCLK (clkl_0 ) , .RB(resetb) 
.Q(cm_icf l_d) ) ; 

assign cm_icfa_p = ciri_icfa & ~cm_icf a_d; 
assign cm__icfl_p = cm_icfl & ~cm_icfl_d; 



endmodule 



# Directory of: sync://star.agere.com:2645/Projects/Peg/chip/peglP/ic/src/ic__d.v; 
# 

# fo— 9/07/2000 13:58:38 ic_d.v;1.1 



import -version 1.1 sync://star.agere.com:2645/Projects/Peg/chip/peglP/ic/src 
ic_d.v 

#* import -version 1.1 sync://star.agere.com:2645/Projects/Peg/chip/peg!P/ic/src 
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# icjlv: Success Imported 



